| Codebook for arcWeeklyTableCombined | …2 |
|---|---|
| NetSense Study, iCeNSA, University of Notre Dame | NA |
| Created by Matthew J. Chandler | NA |
| NA | NA |
| General Description | NA |
| This dataset provides comprehensive information about mobile phone communication events from the first three years of NetSense study (August 2011 through August 2013). Additional participant-reported data on vertex and edge attributes exist in other files; this file contains only observed mobile phone communication data. The unit of observation here is an arc from sender to receiver, where both senders and receivers are either known to be or assumed to be individual persons, identified by unique numeric codes. Any reciprocal arcs (from receiver to sender) are recorded on separate rows, and there are no self-loops. Every arc has at least one study participant, which can be identified by a 5-digit ID code; all other persons have 6-digit ID codes. Most of the data for each arc are recorded in weekly increments–thus detailing the dynamics of communication over time–but the first several variables contain summary information about the arc. Week 1 begins on the official start date of the study on August 15, 2011. Week 0 captures the rare few observations before the official start date. The weekly data are censored at 107 weeks at the latest (ending September 1, 2013), but observation continued through August 2014 (binned in this dataset in week 108+). A minority of arcs are censored earlier: if the participant dropped out of the study early, then the censor date is set for 2/3 the number of total days the participant remained in the study. The censoring allows us to make judgments about whether arcs decayed before the censor date: if the last observation of communication between a sender and receiver occurred before the censor date, then that last observation is treated as the date the arc decayed, but if the last observation occurred after the censor date, then the arc is considered still effecitvely present as of the censor date. The full list of sender-receiver pairs gives the comprehensive directed network, while data from the weekly increments show the network dynamics. But please note that only mobile phone communication involving study participants is observed, so the largest complete network these data represent includes only the arcs in which both vertices are participants (easily filtered by their 5-digit ID codes). | NA |
| NA | NA |
| Variable Name | Variable Description |
| Sender | Unique ID code for the sender of communication (ie, the origin vertex of the arc) |
| Receiver | Unique ID code for the receiver of communication (ie, the terminal vertex of the arc) |
| FirstDate | The date of the first observation of communication, given as yyyy-mm-dd |
| LastDate | The date of the last observation of communication, given as yyyy-mm-dd |
| DurationDays | The number of days spanning from the first to the last daily observations of communication |
| FrequencyDays | The total number of days on which at least one communication event was observed |
| FirstWeekActive | The index number of the week during which the first observation of communication occurred, where week 1 begins on August 15, 2011 |
| LastWeekActive | The index number of the week during which the last observation of communication occurred, where week 1 begins on August 15, 2011 |
| DurationWeeksActive | The number of weeks spanning from the first to the last weekly observations of communication |
| FrequencyWeeksActive | The total number of weeks during which at least one communication event was observed |
| FirstWeekEffective | The index number of the week during which the arc is considered to have effectively formed, where week 1 begins on August 15, 2011 (equivalent to FirstWeekActive) |
| LastWeekEffective | The index number of the week during which the arc is considered to have effectively decayed, where week 1 begins on August 15, 2011 (right-censored; see CensorWeek) |
| DurationWeeksEffective | The number of weeks spanning from FirstWeekEffective to LastWeekEffective (the arc is considered effectively present for all weeks in between) |
| CensorWeek | The index number of the week upon which the observation of communication activity is right-censored (the maximum value possible for LastWeekEffective) |
| TotalFrequency_0 | The total number of communication events (voice and SMS) observed before August 15, 2011 |
| VoiceFrequency_0 | The number of voice communication events observed before August 15, 2011 |
| SMSFrequency_0 | The number of SMS communication events observed before August 15, 2011 |
| VoiceLengthSum_0 | The sum of the length (in seconds) of voice communication events observed before August 15, 2011 |
| SMSLengthSum_0 | The sum of the length (in characters) of SMS communication events observed before August 15, 2011 |
| ActiveDummy_0 | A binary indicator of whether any communication events were observed before August 15, 2011 |
| EffectiveDummy_0 | A binary indicator of whether the arc is considered effectively present before August 15, 2011 (equivalent to ActiveDummy_0) |
| ReciprocatedDummy_0 | A binary indicator of whether reciprocal communication (from receiver to sender; either voice or SMS) was observed before August 15, 2011 (“NA” if never reciprocated) |
| ReciprocatedWeight_0 | The number of reciprocal communication events (either voice or SMS; counted in 1-to-1 pairs: sender-receiver/receiver-sender) observed before August 15, 2011 (“NA” if never reciprocated) |
| NonreciprocatedWeight_0 | The number of unreciprocated communication events (either voice or SMS; sender-receiver only, net of reciprocal events) observed before August 15, 2011 (“NA” if never reciprocated) |
| WeeklyTotalFrequency_w | The total number of communication events (voice and SMS) observed during week w, where w ranges from 1 to 107 |
| CumulativeTotalFrequency_w | The cumulative total number of communication events (voice and SMS) observed up to and including week w, where w ranges from 1 to 107 |
| WeeklyVoiceFrequency_w | The number of voice communication events observed during week w, where w ranges from 1 to 107 |
| CumulativeVoiceFrequency_w | The cumulative number of voice communication events observed up to and including week w, where w ranges from 1 to 107 |
| WeeklySMSFrequency_w | The number of SMS communication events observed during week w, where w ranges from 1 to 107 |
| CumulativeSMSFrequency_w | The cumulative number of SMS communication events observed up to and including week w, where w ranges from 1 to 107 |
| WeeklyVoiceLengthSum_w | The sum of the length (in seconds) of voice communication events observed during week w, where w ranges from 1 to 107 |
| CumulativeVoiceLengthSum_w | The cumulative sum of the length (in seconds) of voice communication events observed up to and including week w, where w ranges from 1 to 107 |
| WeeklySMSLengthSum_w | The sum of the length (in characters) of SMS communication events observed during week w, where w ranges from 1 to 107 |
| CumulativeSMSLengthSum_w | The cumulative sum of the length (in characters) of SMS communication events observed up to and including week w, where w ranges from 1 to 107 |
| ActiveDummy_w | A binary indicator of whether any communication events were observed during week w, where w ranges from 1 to 107 |
| ActiveAge_w | The age of the active arc as of week w–i.e., the running count of consecutive weeks during which any communication events were observed up to and including week w, where w ranges from 1 to 107 |
| ChangeDummy_w | A binary indicator of whether ActiveDummy_w and ActiveDummy_(w-1) are different, where w ranges from 1 to 107 |
| CumulativeChangeFrequency_w | A running count of the number of times ActiveDummy_w changed up to and including week w, where w ranges from 1 to 107 |
| EffectiveDummy_w | A binary indicator of whether the arc is considered effectively present during week w, where w ranges from 1 to 107 |
| EffectiveAge_w | The age of the effective arc as of week w–i.e., the running count of consecutive weeks during which the arc was considered effectively present up to and including week w, where w ranges from 1 to 107 |
| ReciprocatedDummy_w | A binary indicator of whether reciprocal communication (from receiver to sender; either voice or SMS) was observed during week w, where w ranges from 1 to 107 |
| ReciprocatedWeight_w | The number of reciprocal communication events (either voice or SMS; counted in 1-to-1 pairs: sender-receiver/receiver-sender) observed during week w, where w ranges from 1 to 107 |
| NonreciprocatedWeight_w | The number of unreciprocated communication events (either voice or SMS; sender-receiver only, net of reciprocal events) observed during week w, where w ranges from 1 to 107 |
| TotalFrequency_108 | The total number of communication events (voice and SMS) observed after September 1, 2013 |
| VoiceFrequency_108 | The number of voice communication events observed after September 1, 2013 |
| SMSFrequency_108 | The number of SMS communication events observed after September 1, 2013 |
| VoiceLengthSum_108 | The sum of the length (in seconds) of voice communication events observed after September 1, 2013 |
| SMSLengthSum_108 | The sum of the length (in characters) of SMS communication events observed after September 1, 2013 |
11 Codebook for arcWeeklyTableCombined
This codebook describes the variables in the legacy arcWeeklyTableCombined dataset.